Evolutionary Applications 



Evolutionary Applications ISSN 1752-4571 



ORIGINAL ARTICLE 

Multiple introductions and recombination in Cryphonectria 
hypovirus 1: perspective for a sustainable biological control 
of chestnut blight 

Nicolas Feau, 1,2,3 Cyril Dutech, 1,2 Jeremie Brusini, 1,2,4 Daniel Rigling 5 and Cecile Robin 1,2 

1 INRA, UMR1202 BIOGECO, F-33610, Cestas, France 

2 University Bordeaux, BIOGECO, UMR 1202, F-33400, Talence, France 

3 TAIGA-Lab, Forest Sciences Centre, University of British Columbia, #3618-2424 Main Mall, Vancouver, BC, V6T 1Z4, Canada 

4 Department of Ecology and Evolutionary Biology, Earth and Marine Sciences Building, University of California, Santa Cruz, CA, 95064, USA 

5 WSL Swiss Federal Research Institute, CH-8903, Birmensdorf, Switzerland 



Keywords 

biological control, disease biology, host 
parasite interactions, invasive species, 
microbial biology 

Correspondence 

Cecile Robin, INRA, UMR1202 BIOGECO, 
F-33610 Cestas, France. 
Tel.: +33 5 57 12 27 20; 
fax: +33 5 57 12 28 81; 
e-mail: robin@bordeaux.inra.fr 

Received: 20 December 201 3 

In Revised form: 3 March 2014; Accepted: 6 

March 2014 

doi:10.1111/eva. 12157 



Abstract 

Cryphonectria hypovirus 1 (CHV1) is a mycovirus which decreases the virulence 
of its fungal host Cryphonectria parasitica, the causal agent of chestnut blight 
recently introduced in Europe. The understanding of the evolutionary processes 
which have shaped CHV1 populations in Europe is required to develop a sustain- 
able biocontrol strategy targeting chestnut blight and effective in European chest- 
nut forests. To retrace the evolutionary history of CHV1, we analyzed sequences 
from two genomic regions on a collection of 55 CHV1 strains from France and 
northern Spain, two countries where multiple introductions of C. parasitica 
occurred. Several recombination events and variable selection pressures contrib- 
uted to CHV1 evolution, agreeing with a non-clock-like diversification rate. 
These two mechanisms may be at the origin of CHV1 population diversity 
observed in western Europe. Considering the actual prevalence of CHV1 and its 
association with host genotypes, multiple introductions of CHV1 may have 
occurred in Europe, some of them directly from Asia and some of them through 
North America. Although some viral strains remained with low frequency in their 
introduction area, multiple infections might have allowed homologous recombi- 
nation within parental sequences. Some of these recombinant lineages are associ- 
ated with the spread of CHV1 in European regions. 



Introduction 

There is now growing evidence that global change could 
boost chronic impacts of plant pathogens, for example 
through climate warming that releases constraints on their 
development or through transport of exotic species (Paut- 
asso et al. 2012). This prediction tends to be confirmed in 
forests where several new tree diseases emerged during the 
two last decades. In this context, biological control appears 
to be a suitable strategy to contribute to sustainable forest 
management. The use of natural enemies to regulate patho- 
gen populations and to maintain them under an acceptable 
economic threshold of damage can be reached by different 
methods: the classical method is based on the release and 
establishment of non-native natural enemies of exotic 



pathogens, whereas methods of augmentation and conser- 
vation aim at increasing the effectiveness of natural ene- 
mies already present in the environment (Eilenberg et al. 
2001). Mycoviruses, which decrease fungal virulence, are 
seen as promising agents to mitigate fungal tree diseases. 
Such viruses, infecting and replicating in pathogenic fungi, 
are causing a fungal debilitation which results in an overall 
decrease in reproduction rates and/or vegetative growth, 
and are, thus, beneficial to the host plant (Pearson et al. 
2009). For example, mycoviruses could contribute to the 
attenuation of disease outbreaks after pathogens are intro- 
duced in a new area, as it has been suggested for chestnut 
blight and for Dutch elm disease (Milgroom 1999). 
Although mycoviruses are widespread within the fungal 
phylogeny (Ghabrial and Suzuki 2009; Pearson et al. 2009), 



580 



© 2014 The Authors. Evolutionary Applications published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative 

Commons Attribution License, which permits use, distribution and reproduction in any medium, provided 

the original work is properly cited. 



Feau et al. 



Evolutionary history of a mycovirus 



studies on genetic and phenotypic diversity remain limited 
to very few mycoviruses (Voth et al. 2006; Bryner et al. 
2012), and we are far to be able to predict how a mycovi- 
rus, introduced in a 'new' host population, will evolve. As a 
consequence, the use of mycoviruses as biological control 
agents requires an understanding of their ecology and 
evolution (Roderick et al. 2012). 

Several points need to be examined before predicting the 
success of mycoviruses as biocontrol agents. On one hand, 
a biosafe candidate for biocontrol must exhibit a limited 
host range, avoiding its spread over to unexpected host 
targets. Mycoviruses are thought to have evolved in close 
association with their hosts (Ghabrial and Suzuki 2009) 
and have a limited host range, most often limited to one 
host genus (e.g., Liu et al. 2003). On the other hand, bio- 
control agents must be able to establish and spread within 
the targeted host population. With the exception of free 
viral particles (Yu et al. 2013), mycoviruses are generally 
transmitted by an intracellular process, ensuring successful 
spread within the host population. Intracellular transmis- 
sion may involve either vertical transmission through fun- 
gal spores (Pearson et al. 2009) or horizontal transfer by 
the way of anastomosis and heterokaryosis events (Leslie 
1993). As the success of anastomoses in fungi is controlled 
by the vegetative incompatibility gene system (i.e., vie or 
het genes; Saupe 2000), efficiency of mycovirus spread can 
be directly dependent on the interactions of these genes 
within the host population (Cortesi et al. 2001). 

Some mycoviruses have been detected in large geo- 
graphic areas, as illustrated by Cryphonectria hypovirus 1 
(CHV1) the type species of hypoviruses (Hypoviridae fam- 
ily) which infects the fungal pathogen Cryphonectria parasi- 
tica, the causal agent of chestnut blight around the world. 
Native to eastern Asia, its introduction to North America 
at the end of the 19th century resulted in the near elimina- 
tion of American chestnut {Castanea dentata). In Europe, 
where C. parasitica jumped to a new host (the European 
chestnut, C. sativa), several introduction events have 
occurred since the beginning of the 20th century, directly 
from Asia or via North America (Robin et al. 2009; Dutech 
et al. 2010, 2012). The low impact of C. parasitica in Eur- 
ope could be partly explained by the introduction of CHV1 
(Milgroom and Cortesi 2004). The typical 'hypovirulence' 
phenotype, induced by CHV1 in C. parasitica, encom- 
passes several changes in the biology and physiology of its 
host and results in the survival of trees infected by such 
virus-infected fungal strains (Shapira et al. 1991; Nuss 
2005). Hypovirulent C. parasitica strains were isolated for 
the first time in 1964 in Italy and then in France. CHV1 
was later detected in C. parasitica in Japan and China 
where it had most likely originated (Peever et al. 1998). 
CHV1 is now detected in almost all European regions 
where C. parasitica was introduced, but so far has never 



been detected in American C. parasitica populations, 
except in a few sites where it has been intentionally intro- 
duced (Milgroom and Cortesi 2004). CHV1 is grouped 
within positive stranded RNA viruses and has a 12 712 nts 
genome which contains two open reading frames (ORFs) 
encoding multifunctional polyproteins (Nuss 2005; Ghab- 
rial and Suzuki 2009). Based on variation found within 
both ORFs, five different subtypes have been characterized 
with sequence divergence levels ranging from 11% to 19% 
(Gobbin et al. 2003). Subtype I is widespread in southern 
and southeastern Europe from southeastern France to 
Turkey (Sotirovski et al. 2006; Robin et al. 2010; Krstin 
et al. 2011). Phylogeny and time estimates suggest that 
CHV1 was introduced into Italy together with C. parasitica 
and spread across south-central Europe and then eastern 
Europe (Bryner et al. 2012). By comparison, the distribu- 
tions of the other subtypes are much more geographically 
restricted. Subtypes Fl and F2 were first found in France 
and recently in Spain (Zamora et al. 2012) and Turkey 
(Akilli et al. 2013); subtypes E and D were detected in 
Spain and Germany, respectively (Gobbin et al. 2003; 
Peters et al. 2012). 

Very soon after the discovery of hypovirulent isolates, 
Grente and Berthelay-Sauret (1978) demonstrated that 
inoculation of cankers by compatible hypovirulent isolates 
resulted in canker healing. They suggested using a yearly 
release of hypovirulent strains as a treatment against chest- 
nut blight in orchards. This biological control method has 
been successfully applied in southern France for 40 years 
(Grente and Berthelay-Sauret 1978). However, biocontrol 
with CHV1 still needs to be improved for reducing the 
density and impact of C. parasitica populations without 
continuous human assistance. Indeed, after an experimen- 
tal release of hypovirulent strains in northern Switzerland 
where no hypovirulence was detected, CHV1 subtype I 
quickly established and spread (Hoegger et al. 2003), 
whereas in southern France, introduced subtype Fl was 
outcompeted by naturally occurring subtype I strains 
(Robin et al. 2010). Variability in fitness components 
within CHV1 strains and genotype x genotype x envi- 
ronment interactions might affect the outcome of biocon- 
trol (Peever et al. 2000; Brusini 2009; Robin et al. 2010; 
Bryner and Rigling 2011). Moreover, CHV1 transmission 
processes via spores and distance of migration as well as 
transmission between vegetative groups are far to be under- 
stood in natural populations. Asexual spores, in which 
CHV1 is vertically transmitted, have short distance dis- 
persal. Ascospores, which are wind dispersed, have never 
been found infected by CHV1 (Carbone et al. 2004; Pros- 
pero et al. 2006). Phylogenetic studies (Carbone et al. 
2004) and experimental in planta assays (Brusini and Robin 
2013) indicated that the ability of CHV1 to disseminate 
among fungal populations composed of several vegetative 
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compatibility (vc) types might have been underestimated 
by in vitro studies, suggesting that heteroallelism at vie loci 
strongly reduced virus transmission (Cortesi et al. 2001). 
There is also a need to expand biological control to forest 
plantations and coppices and to European areas where 
chestnut blight recently emerged but where CHV1 did not 
yet established, for example northern France (De Ville- 
bonne 1998), Portugal (Braganca et al. 2007), northern 
Switzerland (Hoegger et al. 2000) and part of southwestern 
Germany (Peters et al. 2012). In these regions, chestnut 
blight impact is high and the development of a sustainable 
biological method has been requested by stakeholders. 

The primary objective of our study was to assess the 
genetic diversity and structure of populations of CHV1 
sampled in a geographic area where multiple introduction 
events were detected in C. parasitica. Our hypothesis is that 
genetic structure of the mycovirus is highly spatially corre- 
lated with the introduced host populations. The second 
objective was to study evolutionary processes (recombina- 
tion, mutation, migration and selection) which could gen- 
erate or maintain variation in CHV1 occurring in Europe. 
New viral genotypes can be created through recombination 
events after multiple introductions and be beneficial for 
biological control of chestnut blight. So far, evolutionary 
changes have only been inferred from areas where only one 
CHV1 subtype (subtype I) occurs (Bryner et al. 2012). To 
understand the evolutionary history of CHV1 in Europe, 
investigations were carried out in areas with higher CHV1 
subtype diversity and different host populations. 

Materials and methods 

Fungal isolates 

Isolates used in this study were obtained from untreated 
cankers during previous surveys for C. parasitica (Robin 
et al. 2000; Breuillin et al. 2006; Dutech et al. 2008; Robin 
et al. 2009; Dutech et al. 2010). From this collection, we 
analyzed 49 fungal isolates exhibiting the white phenotype 
(reduced pigmentation and sporulation) usually associated 
with a CHV1 infection: five isolates were from southeastern 
France (SEF), 24 isolates from central France (CF), 11 
isolates from central Pyrenees in France (CPyr), and nine 
isolates from the Atlantic Pyrenees in Spain and France 
(APyr) (Fig. 1, Table 1). In addition, five CHV1 strains 
(named as [fungal isolate]), previously assigned to subtype 
Fl [(2091) and (48.2D)], subtype F2 [2103], and subtypes 
E and D (strains [Ml 147] and [M1372]) were included in 
our sample (Gobbin et al. 2003). Finally, this heterochro- 
nous sampling of CHV1 strains was completed with data 
from two isolates available in Genbank: the complete 
genomes of the viral strains [EP713] (Shapira et al. 1991) 
and [Euro7] (Chen and Nuss 1999). EP713 is an American 
C. parasitica isolate (EP155) infected by the hypovirus that 



was originally obtained in 1966 from southeastern France 
(Robin et al. 2010; S. Anagnostakis, personal communica- 
tion). Euro7 is a naturally virus-infected isolate originating 
from Italy. Forty-five fungal isolates were previously 
genotyped with 10 microsatellites markers. They could be 
assigned to one of the three genetic clusters defined among 
105 French genotypes known so far (Dutech et al. 2010). 
Nineteen isolates (from southeastern and central France) 
were grouped into the genetic cluster Gl, 20 and 4 (from 
the Pyrenees area and central France) belonged to cluster 
G2 and G3, respectively. 

DsRNA isolation, cDNA synthesis and sequencing 

Cryphonectria parasitica isolates were grown on cellophane- 
covered PDA plates for 7 days at 25°C in the dark. The 
mycelium was scraped from the cellophane, freeze-dried 
and ground to a fine powder in a Hybaid Ribolyser 
(Hybaid, Teddington, Middlesex, UK). Viral dsRNA was 
extracted from these powders (20-50 mg) using a CF-11 
cellulose chromatography method (Alleman et al. 1999). 
One microliter (approximately 50 ng) dsRNA solution was 
lyophilized together with 0.5 ug random primers (Pro- 
mega, Madison, WI, USA) and cDNA synthesized as 
described by Allemann et al. (1999). 

Two regions of the CHV1 genome were sequenced. The 
first one from ORF A (positions 1495-1792 in the nucleo- 
tide sequence of CHV1/EP713) amplified with primers 




Figure 1 Sampling localities of Cryphonectria hypovirus 1 strains in 
four regions: southern-eastern France (SEF), Central France (CF), Central 
Pyrenees (CPyr), and Atlantic Pyrenees (APyr). Pie charts represent the 
frequency of the main viral lineages (A1B1, A1B2, A2B1, A3B3, A4B1; 
one color for each, all other lineages are represented in grey) in the four 
regions. 
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Table 1. Origin and genetic clusters of Cryphonectria parasitica {Cp) isolates infected by Cryphonectria Hypovirus 1 (CHV1), viral lineages and recom- 
bination event detected with RDP3 within two genomic regions of CHV1 . 
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(continued) 
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Table 1. (continued) 



Cp isolate 


Region 


Locality 


GPS coordinates (Lambert II) 
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Year of 
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SEF, Southeastern France; CF, central France; CPyr, central Pyrenees; APyr, Atlantic Pyrenees. 
*Genetic clusters were defined in Dutech et al. (2010). 
fNR: non-recombinant viral strain. 



hvepl and hvep2 (Gobbin et al. 2003). For this region, 
sequencing of both strands was performed at the Swiss Fed- 
eral Research Institute (WSL) using an ABI Prism 3130 
Genetic Analyzer (Life Technologies, Carlsbad, CA, USA). 
The second one, from ORF B (positions 6252-6991 in the 
nucleotide sequence of CHV1/EP713) amplified with 
primers developed for this study: ORF B-12F (5'-AT 
CGGGTCTCCCTTCAAGTT-3') and ORF B-12R (5'-CAC 
GACGAGTTCGTTGAGRA-3'). In this case, sequencing of 
one strand was performed by GATC-biotech (http://www. 
gatc-biotech.com). The annealing temperature for amplifi- 
cation with the ORF A and ORF B primer sets were 55°C 
and 58°C, respectively. 

Sequence nucleotide bases were checked for a minimum 
Phred quality score of >20, aligned and chromatograms 
were checked for any ambiguous nucleotides and then 
aligned using ClustalW (Thompson et al. 1994). The ORF 
A alignment was comprised of 298 bp. Although ORF B- 
12F and ORF B-12R amplified a cDNA fragment of about 
780 bp, the flanking sequences were removed and only 
591 -bp sequences were used for analyses of this genomic 
region. 

Phylogenetic analyses 

For each ORF, base content, chi-squared test of homogene- 
ity of base frequencies across sequences and distances were 
computed, and phylogenetic relationships among haplo- 
types were inferred using maximum-parsimony (MP) 
method with PAUP ver. 4.bl0 for PC (Swofford 2003). 
MP trees were generated through heuristic searches with 
1000 random stepwise additions, with tree bisection- 
reconnection branch swapping and saving multiple parsi- 
monious trees (MULTREES turned on). Nonparametric 
bootstrap support values for each dataset were estimated 
by performing 1000 heuristic searches with 100 random 
stepwise additions and all other settings as above. The 
homoplasy matrices generated in PAUP were visually 
inspected to locate parts of trees where there was phylo- 
genetic conflict and to identify the haplotypes responsible 
for this conflict. 



Recombination analysis 

A phylogenetic network was constructed from the concate- 
nated alignment of ORF A and ORF B. The network 
was obtained by the neighbor-net method implemented 
in SPLITSTREE v4.11.3 (Huson 1998). To confirm the 
occurrence of recombination within CHV1, localization 
of recombination breakpoints and identification of likely 
parental sequences were carried out with RDP3 (Martin 
et al. 1998). Several recombination detection methods are 
implemented by this program: RDP, Geneconv, Bootscan, 
Maxchi, Chimaera, Siscan and 3seq. Default settings were 
used throughout. Firm recombination patterns were 
those detected by seven different methods, after Bonfer- 
roni correction for multiple comparisons. The likely 
parental sequences are the non-recombinant sequences 
which had regions the most similar to recombinant 
sequences. 

Estimation of divergence dates 

Analyses were conducted on subsets of ORF A and ORF B 
alignments (ORF A sh and ORF B s h) including only non- 
recombinant sequence blocks to avoid recombination bias. 
The molecular clock hypothesis was first tested by using the 
likelihood ratio test (LRT) to compare the null hypothesis 
of a molecular clock to the alternative hypothesis that lin- 
eages evolve at different rates (Hasegawa et al. 1985). The 
HKY model was the best model for both data subsets using 
the Akaike information criteria (AIC) in jModelTest ver. 
0.1.1 (Posada 2008). Second, 'clock-likeness' of the ORF 
A sh and ORF B sh phylogenetic signals was assessed with the 
Path-O-Gen software (http://tree.bio.ed.ac.uk/software/ 
pathogen/) by calculating the coefficient of variation (CV) 
of distance (i.e., the standard deviation to the mean ratio of 
the distances) from the tree's ancestral root to each of the 
terminal branch tips. The CV was taken as an indication of 
how clock-like the evolution was: the lower the coefficient, 
the more clock-like the evolution. Third, the marginal 
likelihoods of the strict and of the relaxed clock models 
(uncorrelated lognormal and exponential models) were 
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compared within a Bayesian coalescent framework by a 
Markov Chain Monte Carlo method using the Bayesian 
Evolutionary Analysis by Sampling Trees program BEAST 
(http://beast.bio.ed.ac.uk; Drummond and Rambaut 2007) 
utilizing the HKY model of nucleotide substitution and a 
prior substitution rate of 2.56e~ 05 substitutions/site/year 
(95% CI, 0.0-1.70 x 10~ 4 ) as determined previously for 
CHV1 (Gobbin et al. 2003). The best fitting model was 
selected by calculating the Bayes Factors (log 10 (BF) using 
the method developed by Drummond and Rambaut (2007) 
and described in Data SI. 

Identification of most probable location state for lineages 

To determine the probable geographical origin of the dif- 
ferent CHV1 lineages detected in the six sampling regions, 
we used a model of diffusion to discrete states to infer max- 
imum clade credibility trees (Lemey et al. 2009). We first 
assigned sequences from the same approximate region 
(based on sampling geographical coordinates) to one of the 
six discrete sampling regions by using the R-libraries 'geo- 
sphere' and 'cluster' (Table SI for details). Then, based on 
the results obtained in the previous clock-model analyses, 
continuous-time models were calibrated for the ORF A sh 
and ORF B sh datasets under a constant population size tree 
prior and an exponential relaxed molecular clock. Individ- 
ual BEAST runs were performed with 200-million steps in 
the Markov chain and sampling every 10 000 steps to pro- 
duce a posterior tree distribution containing 20 000 geneal- 
ogies. The maximum clade credibility tree (a point estimate 
of the tree with the highest cumulative posterior probabili- 
ties in the posterior distribution of trees) was annotated 
with geographical locations using the software Tree-Anno- 
tator (available in BEAST package). For each node with a 
bayesian posterior probability higher than 0.5, posterior 
state probabilities (PSP) were assigned to the six sampling 
populations. 

To avoid bias arising from unequal sampling size 
between locations, we homogenized the sample sizes for 
all locations from which more than eight sequences had 
been sampled, by randomly sub-sampling eight sequences 
from each of these. Sequences from Italia and Germany 
with only one sequence in each were discarded for these 
analyses. For each of the 25 smaller datasets constructed, 
we performed the same BEAST-phylogeographic analyses 
as those described above. Each time a tree node was 
found being supported with a bayesian posterior proba- 
bility >0.5, PSP for each of the four sampling region 
(central France, Center Pyrenees, Atlantic Pyrenees and 
southeastern France) were collected. This resulted for 
each node in four distributions of probability values 
that were compared by using an one-way anova, and a 
Tukey's HSD test (after) was performed when the sam- 



pling region effect was significant. Finally, to obtain a 
greater confidence, the ORF A sh dataset was enriched 
with additional sequences from Italy (« = 3), south of 
Switzerland (n = 15) and France (n = 2) previously pub- 
lished (Gobbin et al. 2003). Clock-model and phylogeo- 
graphic analyses were carried out with this larger dataset 
as described above. 

Positive selection 

To test for positive selection in ORF A sh and ORF B sh 
subsets, we estimated the rate of non-synonymous 
nucleotide substitutions per non-synonymous site (d N ), 
the rate of synonymous nucleotide substitutions per syn- 
onymous site (d s ) and the ratio to = dN/dS, with the 
method of (Nielsen and Yang 1998) implemented in the 
PAML package of the computer program CODEML 
(Yang 1997). PhyML ver. 3.0 (Guindon et al. 2010) was 
used to estimate the branch lengths of the phylogenetic 
trees of the subset phylogeny, which were used as starting 
values for CODEML. Two models of codon substitution 
(M2a and M8) that allow for heterogeneous selection 
pressures among sites were compared to null models 
(homogenous rate among sites, models Mia and M7) as 
described in Data S2. 

Results 

Sequence analysis 

We obtained unambiguous sequences for ORF A and ORF 
B for all but one viral strain ([Doi9]). The ORF A sequence 
of this strain could not be analyzed because of mixed 
sequence data. To test the hypothesis of an infection by 
more than one viral strain, mono-conidial isolates of this 
fungal culture were grown, and CHV1 strains infecting this 
progeny were analyzed. Two different viral strains were 
detected: one related to [EP713] and one related to 
[Euro7], confirming the double infection hypothesis of 
the Doi9 isolate. This isolate was then excluded from the 
analyses. We constructed an alignment of the 55 partial 
sequences of the ORF A and ORF B genomic regions of 
CHV1, including sequences retrieved from the NCBI 
database. 

In the ORF A region (298 nt), 112 sites were polymor- 
phic giving a rate of 0.384 substitutions per site. Among the 
55 sequences, 40 haplotypes were identified. The most 
frequent haplotype was detected in seven fungal isolates: 
five from one location in central France (Doi6, —40, —460, 
—474 and —494) and two from the Pyrenees range (Lan6 
and Arnll). Three other haplotypes were also found in 
fungal hosts from different geographic regions. In contrast, 
four other haplotypes were found in isolates originating 
from the same geographic location. In the ORF B region 
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(591 nt), 166 sites were polymorphic and the substitution 
rate per site was 0.286 for the whole sample. Forty-nine 
haplotypes were defined from ORF B, with four haplotypes 
representing two or three viral strains. Two strains ([2103] 
and [Gon37] ) shared the same haplotype for both ORFs. 

Phylogenetic relationships between CHV1 strains with 
maximum-parsimony analysis 

The analysis of nucleotide frequencies over all codon 
positions suggested a compositional bias toward G+C 
bases. However, chi-squared tests of homogeneity of base 
frequencies across taxa were non-significant within 1st, 
2nd and 3rd codon positions (Fig. SI). Of the three 
codon positions, third position nucleotides were system- 
atically more variable than those of first and second posi- 
tions as measured by percentage of sites that were 
variable and the percentages of sites that were parsimony 
informative (data not shown). To test the hypothesis of 
saturation of the phylogenetic signal at third codon posi- 
tions, we calculated and plotted uncorrected distances 
and corrected distances between each pair of sequences. 
For each dataset, high slopes (above 0.65, except for first 
codon positions in ORF B) coupled with significant lin- 
ear relations (all r 2 > 0.97) indicated little saturation at 
these codon positions (Fig. SI). Thus, nucleotide compo- 
sition and saturation analyses showed that uninformative 
data should not adversely affect the phylogenetic recon- 
struction analysis (Fig. SI). 

Maximum-parsimony analysis with heuristic searching 
in PAUP resulted in 150 and 105 trees for ORF A and ORF 
B respectively, from which consensus trees were built (Fig. 
S2). Using only the first and second codon position (i.e., 
discarding positions that could had been subjected to mul- 
tiple substitutions; see above), the topologies and resolu- 
tions reflected those obtained for all codon positions (data 
not shown). 

For ORF A, five main clusters were resolved and sup- 
ported with bootstrap values >0.85 (Fig. S2, Table 1). Viral 
strains defined as subtypes Fl ([EP713], [2091] and 
[48.2D]), F2 ([2103]) or I ([Euro7]) were enclosed within 
cluster Al (26 strains), A2 (nine strains) and A3 (10 
strains), respectively. However, strains [Ml 147] and 
[M1372] (subtypes E and D, respectively) grouped in one 
cluster (A4), with six other viral strains. The fifth cluster 
(A5) was represented by a unique viral haplotype, infecting 
two fungal isolates from the Navarre province in Spain 
(Sun9 and Lab7). 

In ORF B, only three clusters, supported by strong boot- 
strap values (100%), were resolved (Fig. S2, Table 1). All 
strains that clustered together in Al and A3 also grouped 
together in clusters Bl and B3, respectively. However, there 
were inconsistencies between ORF A and ORF B phyloge- 



nies. The A2 strains clustered with Bl strains, and the A4 
strains ([Sarl5] and [Doil]) clustered in B3. The A5 strains 
([Sun9] and [Lab7]) were regrouped with four Al, one A2 
and one A4 strains in the B2 cluster. 

Visual inspection of the homoplasy matrices generated 
in PAUP for the ORF A and ORF B MP trees helped to 
identify putative recombinants (Fig. S2). In each dataset, 
the distributions of homoplasy values were trimodal with 
values around zero, indicating little phylogenetic conflict, 
and higher values in the range of 10-20 and >45, indicative 
of more conflict. Similar patterns of phylogenetic conflict 
were found when the codon positions potentially saturated 
(third position) were discarded from the analyses (data not 
shown). In ORF A, conflicts were found between taxa of 
the A4 and Al groups (homoplasy values ranged between 
10 and 18), Al and A2 groups and Al and A4 groups 
(values between 46 and 56). Similarly for ORF B, values 
ranged between 70 and 90 indicating conflicts between taxa 
of the Bl and B2 groups. 

Recombination and definition of CHV1 main genetic 
clusters 

The concatenation of the two ORF regions and the split 
decomposition analysis of the concatenated sequence 
resulted in a phylogenetic network, in which the sequences 
were linked to each other by several branches suggesting a 
complex recombination history (Fig. 2). Viral strains with 
Bl sequences were clearly separated from strains having B2 
and B3 sequences (Fig. 2). Parallel edges were also depicted 
betweens strains Al and A4 and A5. Five main clusters 
gathered 89% of the concatenated sequences (clusters Al- 
Bl represented by 22 strains, A3-B3 10 strains, A2-B1 8 
strains, A4-B1 5 strains and A1-B2 4 strains, Table 1). 

Four firm recombination events were detected with the 
RDP3 program (Table SI, Fig. 3). They resulted in 38 
recombinant strains (out of 55, i.e., 69%). The two main 
clusters identified with the phylogenetic network (Al-Bl 
and A3-B3) gathered recombinant strains resulting from 
the events El and E2 (Fig. 2). Al-Bl strains were mainly 
located in central France and in the Pyrenees region (20 
strains among 22). A striking result is the absence of A3-B3 
strains from the western part of the study area. In 
sequences from clusters A2-B1, A1-B2 and A4-B1, no firm 
recombination pattern was detected. Therefore, all these 
strains, coming mainly from the Pyrenees, were considered 
non-recombinant strains. 

Among the 45 fungal host isolates assigned to one of the 
genetic pools defined for C. parasitica, 39 were infected by 
viruses belonging to one of the five main lineages. The 
relationship between viral and host genotypes was statisti- 
cally significant (Fisher's exact test: P = 0.044e~ 6 , two sided 
Probability = 0.0005). All viral strains belonging to the 
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Figure 2 The phylogenetic network resulting from the split decomposition of concatenated sequences from ORF A and ORF B of Cryphonectria 
hypovirus 1 using the neighbor-net method implemented in SPLITSTREE v4.1 1 .3. 



lineage A3-B3 were found in the fungal Gl gene pool and 
all A1-B2 strains in the G2 gene pool (Table 2). By 
contrast, Al-Bl viral strains were found in all three fungal 
pools. 

Estimation of divergence dates 

Based on the recombination events previously identified, 
two non-recombinant data subsets (ORF A sh and ORF B s h) 
were reconstructed (Fig. 4). LRT-tests rejected the null 
hypothesis of a homogeneous rate of evolution among lin- 
eages for the two subsets (Table 3). For ORF A s h, rejection 
of the null hypothesis was supported by a high degree of 
deviation from the strict molecular clock model in the 
root-to-tip test (CV = 0.16) and a significantly higher mar- 
ginal likelihood value for the relaxed uncorrelated expo- 
nential molecular clock (log 10 BF >1.3 when compared to 
strict clock model; Table 4, Table S2). For ORF B s h, devia- 
tion from the strict clock model was less clear. On one 
hand, the low CV value of 0.10 and a highly significant 
LRT test did not indicate a clock-like evolution of this sub- 
set. On the other hand, differences in the marginal likeli- 
hood value between the strict and relaxed molecular clock 
models were below the significance threshold of 1.3 gener- 
ally accepted (log 10 BF = 0.49, Table 3). 

Three demographic model parameters (i.e., constant size, 
exponential size and expansion size) were also tested for 
each clock model fitted to the data subsets (Table S2). 



Marginal likelihood analyses showed that no other demo- 
graphic model significantly fit ORF A sh and ORF B sh data 
better than the constant size population model [log 10 (BF) 
always <0.94], suggesting that there could be insufficient 
information in the data to fit a more complex demographic 
model. 

Finally, using an exponential with constant population 
size model, average evolution rates were 2.483 x 10~ 4 
[95% credibility intervals (CI) of 0.289 x 10~ 4 to 
4.929 x 10~ 4 ] and 1.716 x 10~ 4 (95% CI, 0.303 x 10~ 4 to 
3.410 x 10~ 4 ) substitutions per site per year for ORF A sh 
and ORF B sh , respectively (Table S3). Under these rates, the 
tMRCA obtained for the different virus lineages with the 
two data subsets were mostly consistent, except for the age 
of the most recent common CHV1 ancestor [605 years for 
ORF A sh (1406, 95% CI, -809/+498 years) and 903 years 
for ORF B sh (11 088, 95% CI, -1332/+735)] (Fig. 4). The 
analysis of the full dataset (i.e., ORF A + ORF B), includ- 
ing interlineage recombinants with highly divergent CHV1 
genes, resulted in an estimation of the time to the most 
recent common CHV1 ancestor of 110 years before (i.e., at 
998 years; 95% CI, -1353/+816) the date estimated with 
the ORF B sh dataset (Fig. S3). 

Internal nodes differentiating sister CHV1 lineages 
(though generally weakly resolved; but see below) are 
dated between 1659 for the B2-B3 node (ORF B sh ) to 
1713 for A2-A3 (ORF A sh ), with the 95% upper CI gener- 
ally overlapping the first half of the twentieth century 
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Figure 3 (A) Firm recombination events detected (E1, E2, E3 and E4) within 55 viral sequences of Cryphonectria hypovirus 1 by using seven methods 
of the RDP3 software. (B) Putative scenarios of the recombination events with putative parental and resulting recombinant CHV1 lineages identified 
by split decomposition (cf Fig. 2). 



(1937 and 1940 for B2-B3 and A2-A3, respectively, Fig. 4). 
The tMRCA for clusters A3 and B3 (all strains A3 are B3) 
were established at 1912 and 1920, (with 95% CI, -105/ 
+63 and — 97/+51 years), respectively. Radiation times for 
the two lineages Al (1945, 95% CI, -54/+29 years and 
1940, -47, +26) and A4 (1942, 95% CI, -92/+41 years) 
fit with the tMRCA range for the corresponding strains 
within the Bl lineage (1942, -76, +43). Unfortunately, the 



Table 2. Distribution of CHV1 lineages in its host (Cryphonectria par- 
asitica) genetic clusters. 





Viral lineagesf 








Host genetic 










clusters* 


A1-B1 A3-B3 


A2-B1 


A4-B1 


A1-B2 Others 


G1 


8 8 


1 




2 


G2 


8 


5 


3 


4 2 


G3 


1 




1 


2 


ND 


5 2 


2 


1 


0 



*Defined in Dutech et al. (2010), ND: not determined. 

f Lineages defined after concatenation of ORF A and ORF B sequences. 



phylogenetic resolution obtained for the MP consensus 
tree with ORF A for the A1/A4 and A2 clusters (bootstrap 
support of 100%) was not obtained with the BEAST 
analysis on the ORF A sh subset, preventing dating the 
tMRCA for this cluster. Despite this, radiation of the A2 
viral lineage was estimated at 1873 (95% CI, —150/ 
+ 101 years) with this subset, 68 years (95% CI, -150/ 
+101 years) after the tMRCA estimated for lineage Bl 
with ORF B sh (Fig. 4). 

CHV1 lineages' most probable location of origin 

Both ORF A sh and ORF B sh provided congruent informa- 
tion about putative origins of the different lineages, with 
generally a greater resolution for ORF A s h (Fig. 5, Fig. S3). 
For the deepest nodes (i.e., those dated before 1850), 
similar PSP were generally assigned to the six sampling 
populations. Lack of phylogenetic structure at this level, 
consistent with the weak phylogenetic support observed, 
may explain the failure to clearly determine a probable 
origin for these nodes. Results from both ORFs converged 
in a way that the most recent common ancestor for 
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Figure 4 Lineages divergence dates inferred from ORF A S h and ORF B S h sequences of Cryphonectria hypovirus 1. The time-scale of evolutionary 
changes is indicated by the scale bar below the tree. Divergence dates with 95% CI are indicated above nodes; Bayesian posterior probabilities up to 
0.5 are indicated below nodes. 



lineages A3 and B3 likely emerged from southeastern 
France (PSP = 0.47 for lineage A3 and 0.37 for B3) or 
Italy (PSP = 0.26 for A3 and 0.19 for B3, Fig. 5). Re- 
analyzing ORF A sh with 18 additional sequences from 
Switzerland and Italy also indicated an eastern origin for 
the A3 lineage (PSP = 0.63 for Switzerland/Italy and 0.26 
for South-East; data not shown). A2 lineages likely origi- 



nated from the Center Pyrenees (PSP = 0.73, Fig. 5) and 
A4 and B2 lineages from the Atlantic Pyrenees 
(PSP = 0.53 and 0.34, Fig. 5). Origin for the Al strains 
was less clear with 35% of the trees assessed during the 
analysis consistent with an ancestral sequence originating 
from southeastern France and 38% from central France. 
Similarly, the probable origin of the Bl strains could not 



Table 3. Molecular clock model fitting for the ORF A sh and ORF B sh regions of Cryphonectria Hypovirus 1 . 









LRT testf 






Molecular clock modelj 






Region 


Length 


Root-to-tip CV* 


-Ln H 0 


-Ln H, 


P-value 


Strict 


Relaxed exponential 


Logio BF 


Model selected§ 


ORF A sh 


199 


0.16 


910.45 


766.07 


<0.0001 


-760.1 


-756.4 


1.62 


Relaxed exponential 


ORF B sh 


383 


0.043 


1525.64 


1351.98 


<0.0001 


-1392.8 


-1389.1 


1.58 


Relaxed exponential 



*Coefficient of variation that is ratio of the standard deviation to the mean for the root-to-tip distances. 

fLikelihood ratio test between likelihood values obtained under the null hypothesis of a molecular clock and the alternative hypothesis each tree 
branch is allowed to vary independently. 

^Likelihood values of the BEAST molecular clock models (strict molecular clock and relaxed exponential molecular clock) applied to the datasets. BF, 
Bayes Factor(log 10 ) values between the likelihood values obtained for the strict and the relaxed molecular clock models. 
§Clock model selected for divergence time estimations. 
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Table 4. Maximum likelihood tests of positive selection into two genomic regions (ORF A sh and ORF B sh ) of Cryphonectria Hypovirus 1 (CHV1). 



No. sites under positive 



Model 


Model parameters 


-InL 


Models comparison 


2AL 


Pr. 


selection (Pr. co > 1) 


M 1a (neutral) 


P 0 = 0.80, P, = 0.20 


714.73 


M1avsM2a 


7.71 


<0.01 


1 (0.99) 


M2a (selection) 


P 0 = 0.86, P, = 0.13, 
P 2 = 0.096, co 2 = 6. 08 


710.88 










M7 (beta) 


P = 0.008, q = 0.024 


714.95 


M7vs M8 


8.25 


<0.01 


1 (0.99) 


M8 (beta + co) 


P 0 = 0.99, P = 0.26, q = 0.96, 
P, = 0.010, co = 5.71 


710.82 










M1a (neutral) 


P 0 = 0.83, P, = 0.17 


1332.69 


M1avsM2a 


7.16 


<0.01 


2 (0.99, 0.98) 


M2a (selection) 


P 0 = 0.83, P, = 0.16, 
P 2 = 0.005, o) 2 = 5.81 


1329.1 1 










M7 (beta) 


P= 0.005, q = 0.02 


1333.07 


M7vs M8 


8.25 


<0.01 


3 (0.96, 0.99, 0.99) 


M8 (beta + co) 


P 0 = 0.99, P= 0.01, q = 0.08, 
P, = 0.005, co = 5.57 


1328.95 











be fully resolved. ORF B sh indicated a probable origin 
from southeastern France and central France for half of 
these strains (PSP = 0.26 and 0.33, respectively) and cen- 
tral and Atlantic Pyrenees (0.35 and 0.30, respectively) for 
the other half (Fig. S4). 



Positive selection 

For both data subsets, LRT-values for comparing models 
Mia with M2a and M7 with M8 were statistically 
significant, suggesting that discrete models M2a and M8 




0.6 



0.4 



0.2 



Ger. ■ Italy SEF 
ICF BC. Pyr Atl. Pyr 



I A II 



Ik 



Bl 



B2 



B3 




0.6 



Ger. ■ Italy SEF 
ICF BC. Pyr 1 Atl. Pyr 



a b abab 



Bl 



B2 



I t ■ 



B3 



Figure 5 Posterior state probabilities of population origin (PSP) inferred from Cryphonectria hypovirus 1 ORF Ash (left panels) and ORF Bsh (right 
panels) datasets. (A) PSP-values obtained for the original datasets. (B) PSP-values obtained after resampling ORF Ash or ORF Bsh datasets 25 times 
(see Methods). Letters above columns indicate significant differences among posterior location probabilities means as determined by a Tukey's HSD 
test after significant one-way anova. 



590 



©201 4 The Authors. Evolutionary Applications published by John Wiley & Sons Ltd 7 (2014) 580-596 



Feau et al. 



Evolutionary history of a mycovirus 



systematically fit these two data subsets better than the neu- 
tral models Mia and M7 (Table 4). Model M2a indicated 
that 0.8% and 0.4% of the sites were under strong positive 
selection with <±> = 6.08 and m = 5.75 in ORF A sh and ORF 
B sh , respectively. Model M8 showed for the two subsets that 
about 99% of the sites had co from a U-shaped beta distri- 
bution, and about 0.5-1.0% of sites were under strong 
diversifying selection with tu = 5.78 for ORF A s h and 5.58 
for ORF B sh . In OFR A sh under models M2a and M8, 
empirical Bayes' theorem identified one amino acid site 
under positive selection with >99% confidence. In ORF 
B s h, the two same sites under positive selection identified 
with model M2 (98 and 99% of confidence) were also iden- 
tified with model M8 (>99% confidence). 

Discussion 

Our sequence analysis of CHV1 strains from France and 
Spain revealed a high virus diversity and multiple introduc- 
tions of viral lineages. This result markedly contrasts with 
the CHV1 diversity in southern and southeastern Europe, 
where only one CHV1 subtype (subtype I) was found 
(Bryner et al. 2012). Sequencing more than one genomic 
region was necessary to discriminate lineages as illustrated 
by subtypes E and D (Gobbin et al. 2003), definitively split 
in lineages A4-B1 and A4-B2 by the use of ORF B informa- 
tion. Thus, five main divergent and four secondary viral 
lineages were detected within the CHV1 population from 
western Europe. Lineages Al-Bl, A2-B1 and A3-B3 corre- 
spond, respectively, to the subtypes Fl, F2 and I, which 
were previously defined by a RFLP approach and ORF A 
sequencing (Alleman et al., 1999; Gobbin et al. 2003). 

Our finding of one fungal isolate infected by two differ- 
ent viral lineages provides evidence that such multiple 
infections in C. parasitica may occur in populations where 
different viral lineages coexist. Multiple CHV1 infections 
have previously been suspected in one C. parasitica isolate 
(Elliston 1985), but not genetically resolved. Mixed virus 
infections could have a synergistic effect on both viruses, as 
observed in a double infection by CHV1 and a MyRVl a 
mycoreovirus (Sun et al. 2006) or a deleterious effect, as 
observed for a mixed infection by CHV3 (Smart and Ful- 
bright 1995). When different strains of the same virus coin- 
fect the same host, hybrid molecules can be produced 
during the replication process resulting in homologous 
recombination. In this study, we clearly identified recombi- 
nation footprints in a majority of the viral strains (83%) 
within ORF A and ORF B, but also between the two ORFs. 
Two recombination events have resulted in two CHV1 lin- 
eages, which are widely spread and more frequent than the 
putative parental strains. The lineage Al-Bl, comprised of 
subtype Fl strains, was likely derived through recombina- 
tion between ancestral strains in the A4-B1 and A1-B2 



lineages. This recombination event might have occurred in 
western France or in Spain where both parental lineages 
and the recombinant one are found. CHV1 viruses related 
to subtype Fl were also recently described in northern 
Spain indicating an expanded distribution range of this 
subtype (Zamora et al. 2012). The lineage A3-B3, corre- 
sponding to the widespread subtype I, could have resulted 
from a recombination event (E2) within ORF B involving 
strains from the A1-B2 lineage and, at least one, unknown 
strain. In contrast, the events E3 and E4 gave rise to two 
lineages detected with a low frequency. However, these 
recombination events might be more recent than the two 
previous ones, and the viruses may not have had enough 
time to spread efficiently. The emergence of hypovirulence 
observed in western and southern Europe associated with 
the spread of a few recombinant genotypes may have 
resulted from a beneficial effect of recombination events on 
CHV1 individuals. Experimental evidence has shown that 
recombination between two viral genomes (differing by 
20%) has more often a beneficial than deleterious effect 
(Vuillaume et al. 2011). Random point mutations are often 
deleterious or lethal; however, homologous recombination 
creates new genotypes resulting from the integration of 
genomic regions into recipient genomes, both of which 
have already been under selective pressure. Such a benefi- 
cial effect can be compared to the positive effect that 
hybridization can have on the fitness of herbivorous insects 
used as biocontrol agents (Szucs et al. 2012) or to the effect 
of genetic recombination on the adaptive potential of 
introduced invasive species (Lavergne and Molofsky 2007; 
Keller and Taylor 2010). 

However, recombination events make any inference 
about the origin and evolutionary trajectories of CHV1 lin- 
eages difficult. The size of the recombination blocks and 
breakpoint positions could not be determined without the 
sequence that lay between the two regions analyzed in our 
study (i.e., 4460 nt retrieved from the complete genome 
sequences of [Euro7] and [EP713]), and the inferred 
recombination scenarios (parental lineages, location and 
date of the recombination) must be considered with cau- 
tion. To infer the evolutionary histories, we analyzed two 
genomic regions (A sh and B sh ) without significant evidence 
of recombination, although the possibility that undetected 
recombination events cannot be completely ruled out, 
especially for ORF B. The occurrence of ancient interlin- 
eage recombinants in sequence datasets might have biased 
the estimated age of the most recent common ancestor by 
pulling the estimated date deeper in the past (Awadalla 
2003; Duffy and Holmes 2008; Lefeuvre et al. 2010). Using 
the relaxed evolutionary clock model, we inferred a rate of 
substitution per site per year of 2.5 x 10~ 4 for ORF A sh 
and 1.7 x 10~ 4 for ORF B sh . These values are similar to 
those previously estimated with a relaxed clock model 
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applied on ORF A sequences (Bryner et al. 2012) and in 
the range of substitution rates of other RNA viruses (Duffy 
et al. 2008). Our data confirmed that the first divergence of 
the different lineages had occurred prior to the transconti- 
nental migration of C. parasitica (which did not occur 
before the end of the 19th century), similarly to what Gob- 
bin et al. (2003) suggested by using a different phylogenetic 
model. However, due to large confidence intervals, it is dif- 
ficult to date, with precision, the radiation of CHV1. Tak- 
ing into account OFR A sh , the radiation of the dominant 
lineages might coincide with the introduction of C. parasi- 
tica into America and Europe (around 1900 and 1930, 
respectively). 

CHV1 diversity appeared geographically structured, mir- 
roring the distribution of C. parasitica diversity. CHV1 
diversity was higher in the Pyrenees and central France 
regions where C. parasitica populations are genetically 
more diverse, and lower in southeastern France where vc 
type and genotype diversity of C. parasitica is also lower 
(Robin et al. 2000, 2009; Dutech et al. 2010). In Pyrenees 
and central France, up to seven CHV1 lineages were 
detected (among 22 viral strains in each case). A close asso- 
ciation was observed between the viral lineages A2-B1 and 
A4-B1 and C. parasitica gene pools G2 and G3, respec- 
tively. Bayesian analyses yielded a radiation occurring in 
the 1940s for A2 and A4 clusters and a most probable ori- 
gin in the central and Atlantic Pyrenees. Thus, all these 
results are in agreement with an introduction of these viral 
lineages in southern France or northern Spain directly from 
China or Japan, as it has been suggested for the C. parasiti- 
ca gene pools G2 and G3 (Dutech et al. 2012). Imported 
Asian chestnut plants resistant to ink disease (Heiniger and 
Rigling 1994; Robin et al. 2009) might have introduced 
both C. parasitica and its hyperparasitic virus into this 
area. The lineage Al-Bl appeared similarly associated with 
the gene pool G2 mainly in central Pyrenees and central 
France and to Gl mainly in central France. One hypothesis 
which could be tested by using a more exhaustive sampling 
is that the lineage A1B1 migrated together with G2 and G3 
into Dordogne (central France) and there came into con- 
tact with fungal isolates from the Gl gene pool (Dutech 
et al. 2010). Migration was likely accelerated by the use of 
the viral A1B1 lineage for biological control in France since 
1970 (Robin et al. 2010). That could explain why this viral 
lineage was transmitted to another host gene pool. A rare 
transmission event was not required because horizontal 
virus transmission can occur between gene pools (Brusini 
and Robin 2013). 

In southeastern France, the lineage A3-B3 was dominant, 
matching findings from a previous study where 78 of 79 
viral strains from this area belonged to subtype I (Robin 
et al. 2010). The most likely origin of this lineage was 
southeastern France or Italy, where C. parasitica was 



officially first reported in Europe in 1938, right after the 
radiation times estimated for clusters A3 and B3 (1912 and 
1920, respectively). These radiation times also coincide 
with the First World War, during which chestnut wood 
had been imported into Italy from North America (Grente 
1981). The A3-B3 lineage was clearly associated with the 
Gl fungal lineage, which most likely has a Japanese origin 
and was introduced into Europe via North America (Du- 
tech et al. 2012). These results are in agreement with a 
genetic analysis of ORF A with CHV1 strains, showing that 
Italian strains might have originated in Japan (Liu et al. 
2007). All these data suggest that the viral A3-B3 lineage 
followed its host lineage Gl from Japan to Europe. How- 
ever, the sticking point is that CHV1 did not establish in 
America (Milgroom and Cortesi 2004). Other Cryphonec- 
tria hypoviruses, never detected in Europe, are prevalent in 
North America but are typically much less virulent than 
CHV1 (Peever et al. 1997). The CHV1 lineage A3-B3 might 
have remained unnoticed in America, as a cryptic mycovi- 
rus, in the first years after the C. parasitica introduction, 
and then could have been vectored to Italy. Because the vc 
type diversity rapidly increased during the American chest- 
nut blight epidemic, CHV1 could not persist. In Europe, 
this recombinant lineage adapted to its new environment 
and did spread and establish in the fungal founder popula- 
tion, which had a low vc type diversity. This scenario is 
supported by the predictions of an epidemiological model, 
which explains how a feedback mechanism exerted by the 
fungal host may result in selection against highly virulent 
viruses (Brusini et al. 2011). Another hypothesis is that the 
recombination event occurred in Europe after introduc- 
tions of the ancestors in western France, followed by a 
quick and efficient spread of the recombinant strains in 
eastern France whereas the ancestors did not spread or dis- 
appear. 

Both hypotheses suggest that the high prevalence of this 
recombinant lineage may be the result of adaptation and 
selection by the host and/or environment, which could also 
explain the close association between this viral lineage and 
the fungal lineage Gl. Difference in fitness among CHV1 
subtypes have been reported (Peever et al. 2000). A long- 
term field study also suggested that a greater ecological fit- 
ness of subtype I (lineage A3-B3) compared to subtypes Fl 
and F2 probably accounted for its greater invasiveness 
(Robin et al. 2010). Moreover, as fitness components can 
vary significantly depending on genotype x geno- 
type x temperature interactions (Bryner and Rigling 
2011), divergent selection could have favoured specific viral 
lineages in specific host populations and environments. We 
showed clear evidence that some amino acids in ORF A 
and ORF B were subject to positive selection, although such 
protein encoding regions are generally expected to evolve 
under conservative pressures (i.e., purifying selection). 



592 



©201 4 The Authors. Evolutionary Applications published by John Wiley & Sons Ltd 7 (2014) 580-596 



Feau et al. 



Evolutionary history of a mycovirus 



Interestingly, positive selection was detected only for the 
A3-B3 lineage, suggesting that host adaptation could have 
occurred. 

Several recombination events and variable selection pres- 
sures contributed to CHV1 evolution, agreeing with a non- 
clock-like diversification rate. These two mechanisms may 
be at the origin of the CHV1 populations. Ancestors intro- 
duced in western France either gave raise to lineages which 
beneficiated to recombination and adapted to their new 
biotic and abiotic environment, or remained with low fre- 
quency in this area. Taking into account these two evolu- 
tionary processes is crucial for developing a biological 
control strategy against chestnut blight using CHV1. Our 
results provide evidence that divergent CHV1 lineages 
recombine and may adapt over time to establish in a new 
area. At this time, biocontrol with CHV1 relies on a 
method intermediate between a classical control method 
(using an exotic hyperparasite of an invasive pest species) 
and an augmentative method (increasing the density and 
incidence of an already naturally occurring enemy). Using 
multiple CHV1 lineages in areas where classical biological 
control is still attempted may be an option to promote 
beneficial recombinations between different lineages and 
new associations with hosts. However, the phenotypic and 
genetic diversity of CHV1 may also have an effect for 
C. parasitica management. Indeed, theoretical studies pre- 
dict that virulence should be modified when multiple 
strains of parasites regularly compete within hosts (Franck 
1996). When releases of a hyperparasite aim at a successful 
regulation of primary parasite populations, one can wonder 
whether strains with the highest virulence are the best ones 
to use. Morozov et al. showed that mild strains of the 
hyperparasite, characterized by a high vertical transmission 
rate and low virulence, more readily establish in a host 
population than severe strains (Morozov et al. 2007). On 
the contrary, in areas where CHV1 lineages are well 
established, our results suggest that one should use adapted 
lineages, which naturally spread and which could out- 
compete non-adapted ones. 

Classical biological control relies on deliberate biologi- 
cal invasions and hence has been used to study the evo- 
lution of microorganisms introduced into new areas and 
how their invasion success may depend on demographic, 
genetic and environmental factors (Hufbauer and Roder- 
ick 2005; Roderick et al. 2012; Fauvergue et al. 2012). 
Vice-versa, incorporating evolutionary principles in agri- 
culture, as advocated by Hendry et al. (2011), would be 
especially beneficial to control diseases with an environ- 
mentally friendly approach. Thus, insights gained in the 
scientific field of invasion biology should also be used to 
improve applied biological control. Genetic recombina- 
tion and selection events may explain the establishment 
success of invasive organisms (Facon et al. 2008). This 



evolutionary process may be enhanced if invasive species 
are originating from different gene pools or hybridize 
with local genotypes or species (for example Lavergne 
and Molofsky 2007; Keller and Taylor 2010). These les- 
sons and those learned from the vigor of hybrids (Szucs 
et al. 2012) and high recombination rates in viruses (Vu- 
illaume et al. 2011; Cory and Franklin 2012) could open 
new perspectives for classical biological control. For the 
chestnut blight disease for example, diversifying the ori- 
gins of CHV1 might be a solution in northern America 
where repeated failures of establishment of one specific 
genotype have been reported. 
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